SemEval-2012 Task 4: Evaluating Chinese Word Similarity
نویسندگان
چکیده
This task focuses on evaluating word similarity computation in Chinese. We follow the way of Finkelstein et al. (2002) to select word pairs. Then we organize twenty undergraduates who are major in Chinese linguistics to annotate the data. Each pair is assigned a similarity score by each annotator. We rank the word pairs by the average value of similar scores among the twenty annotators. This data is used as gold standard. Four systems participating in this task return their results. We evaluate their results on gold standard data in term of Kendall's tau value, and the results show three of them have a positive correlation with the rank manually created while the taus' value is very small.
منابع مشابه
MIXCD: System Description for Evaluating Chinese Word Similarity at SemEval-2012
This document describes three systems calculating semantic similarity between two Chinese words. One is based on Machine Readable Dictionaries and the others utilize both MRDs and Corpus. These systems are performed on SemEval-2012 Task 4: Evaluating Chinese Word Similarity.
متن کاملTiantianzhu7: System Description of Semantic Textual Similarity (STS) in the SemEval-2012 (Task 6)
This paper briefly reports our submissions to the Semantic Textual Similarity (STS) task in the SemEval 2012 (Task 6). We first use knowledge-based methods to compute word semantic similarity as well as Word Sense Disambiguation (WSD). We also consider word order similarity from the structure of the sentence. Finally we sum up several aspects of similarity with different coefficients and get th...
متن کاملWord Similarity Computing Based on Hybrid Hierarchical Structure by HowNet
Word similarity computing is one of the most important and fundamental task in the field of natural language processing. Most of word similarity methods perform well in synonyms, but not well between words whose similarity is vague. It confronts the challenge of how to overcome this problem. An approach is proposed to compute Chinese word similarity based on hybrid hierarchical structure by How...
متن کاملAn Approach of Hybrid Hierarchical Structure for Word Similarity Computing by HowNet
Word similarity computing is an important and fundamental task in the field of natural language processing. Most of word similarity methods perform well in synonyms, but not well between words whose similarity is vague. To overcome this problem, this paper proposes an approach of hybrid hierarchical structure computing Chinese word similarity to achieve fine-grained similarity results with HowN...
متن کاملUTD: Determining Relational Similarity Using Lexical Patterns
In this paper we present our approach for assigning degrees of relational similarity to pairs of words in the SemEval-2012 Task 2. To measure relational similarity we employed lexical patterns that can match against word pairs within a large corpus of 12 million documents. Patterns are weighted by obtaining statistically estimated lower bounds on their precision for extracting word pairs from a...
متن کامل